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(54) Method and system for information retrieval 



(57) The method and system for retrieving informa- 
tion allow the user to retrieve desired information from 
database systems by simply specifying retrieval content 
and retrieval conditions using languages familiar to the 
user, without having to know the names for the relevant 
database systems or their structures or without having 
to interact with the platform of the meta-information. i. 
e., information concerning the use of database systems 
and accessing method. The present method allows re- 
trieval of information from different database systems 
connected to a communication network, by using refer- 
ence information stored beforehand to access different 
database systems: consulting the reference information 



to determine database systems which contains data to 
satisfy the information retrieval request by converting re- 
quested items to equivalent related words that the sys- 
tem can recognize. The storage locations of the data 
and instructions on how to acquire the data are deter- 
mined using relational items to link various tables so that 
an information retrieval statement can be prepared by 
the system program on user's behalf. The databases are 
searched according to the acquiring method and the in- 
formation retrieval content described in the system pre- 
pared statement, and the retrieved information is pre- 
sented for viewing in a format that is used by the infor- 
mation searcher. 
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Description 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates in general to methods 
and systems for retrieving information, and relates in 
particular to a method for retrieving information, speci- 
fied by an information searcher, from a plurality of dif- 
ferent database systems which are connected to a com- 
munication network. 

Description of the Related Art 

Searching and retrieving information in a database 
system is carried out by first specifying a target data- 
base residing in certain meta-information (known as 
metadata or schema information, i.e., information on the 
use of the database and preparing a structured query 
language (SQL) for) to describe what table to be 
searched, under what search conditions and what type 
of information to be retrieved. 

When searching for information in a number of da- 
tabases connected to a communication network, there 
are search systems that give an impression that the 
search is being carried out within one database, by hav- 
ing the database manager integrate all the separate 
sections of meta-information from applicable databases 
so that there is no need for the information searcher (us- 
er) to be aware of the existence of separate databases 
or their retrieval methods to carry out searches through 
different databases. 

When communicating with databases distributed 
over several database systems, there are also known 
methods to work with a structure of the meta-information 
which is common to all the database systems, so that 
the user's request is executed by generating a common 
request to all such systems, and the search results are 
displayed in a summary table; or when the content of 
the retrieval information has been decided or when the 
structure and format of the specified information are giv- 
en in the retrieval conditions, the specified search con- 
tent can be analyzed and the target database systems 
can be selected to lead to the execution of specific 
search commands. 

However, integrated meta-information managed by 
such conventional multi-dalabased systems connected 
to a number of databases are strictly dependent on the 
data managed by some database management system 
(DBMS) and are avaiiaDle as an interface to particular 
sets of data. In using such a retrieval system, the user 
must be cognizant of the type of data stoked, their struc- 
tures and the format. 

Furthermore, there is no assurance that the infor- 
mation available througn general meta-information plat- 
forms are grouped after interpreting the meaning of the 
stored data, because data are often selected on the ba- 



sis of ease of programming or to maintain some consist- 
ency in daiabase tables. This situation presents a prob- 
lem because it is difficult for the user to anticipate the 
type of information actually being stored unless the user, 
5 searching through databases connected to a network, 
is relatively familiar with the contents of information de- 
scribed in the meta-information. 

Further, when searches are conducted through a 
plurality of distributed database systems, it is necessary 

^0 that the different meta-information be built on an identi- 
cal structure, for the retrieval to be successful. Addition- 
ally, when working in such a distributed information sys- 
tem, it must be remembered that the system manager 
is only capable of searching for a set of request items 

*5 or words recognizable by the DBMS and search results 
are then reported to correspond to the common items. 
Therefore, it is difficult for the system manager to re- 
spond to individualized request items of the user. Al- 
though the user can specify certain database systems 

20 to be searched with specific search commands, when 
the structures and formats of the required information 
are specified in search content or in search conditions, 
this is possible only if all the database systems of inter- 
est share strictly identical structures, because the exist- 

25 ing search methodologies do not permit the user to re- 
trieve information from database systems having differ- 
ent structures. 

Therefore, using the conventional information re- 
trieval systems based on present meta-information or 

30 schema information offered by an integrally managed 
platform, the user is required first to interact with an in- 
tegrated platform of meta-information provided by some 
DBMS, and then to enter search commands directly ap- 
plicable to relevant database structures. As the number 

35 of database systems connected to a network increases 
in the future, it becomes increasingly difficult for the user 
to first understand the contents of the large number of 
database systems, and then to describe search com- 
mands and specify necessary conditions to obtain de- 
sired information. 

With further advancement anticipated in the com- 
munications network, there is a growing desire to con- 
nect various database systems to the network so as to 
more effectively utilize data stored in various database 

-*5 systems. However, even if a technical environment is 
created so that a large number of databases become 
accessible in the network, the user still faces a situation 
of vast amounts of information exceeding his ability to 
develop proper understanding of the potential inferma- 

50 tion available, resulting in difficulties in retrieving desired 
information from a widely distributed databases. 

SUMMARY OF THE INVENTION 

55 it is an object of the present invention to provide an 
information retrieval system to enable an information 
searcher or user to carry out a thorough and effective 
retrieval from a plurality of different databases managed 
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by different management systems, without having to in- 
teract with integrally managed meta-information, sche- 
ma information or to specify technical requirements for 
the operation of different database systems, such as the 
locations and structures of the plurality of databases. 

The object has been achieved in a method for re- 
trieving requested information, comprising the steps of: 
storing reference information beforehand to access a 
plurality of database systems; consulting the reference 
information upon receiving the information retrieval re- 
quest; determining a database system which contains 
data to satisfy the information retrieval request as well 
as a storage location of the data, and obtaining instruc- 
tions relating to an acquiring method for retrieving data 
from the database system: preparing an information re- 
trieval statement in conformity with the storage location 
and the acquiring method: searching through the data- 
base system for requested information according to the 
information retrieval statement and the acquiring meth- 
od: and presenting retrieved information to an informa- 
tion searcher. 

Accordingly, the present method of retrieving infor- 
mation allows the user to retrieve desired information 
from a plurality of different storage locations and data- 
base systems, by simply specifying retrieval content and 
retrieval conditions using words that are familiar to the 
user without the need for the user to interact with the 
integrated meta-information platform which manages 
different database systems or to know the names for da- 
tabases or the structures of different databases. 

Also, the method is most preferably practiced by us- 
ing an information retrieval system having a communi- 
cation network, a plurality of different database systems 
connected to the communication network, and retrieval 
means for retrieving information in response to an infor- 
mation retrieval request entered by an information 
searcher, and comprising: reference information stor- 
age means for storing access requirements for access- 
ing the plurality of different database systems: storage 
location information retrieval means for retrieving infor- 
mation, concerning database locations, database struc- 
tures and database format to satisfy the information re- 
trieval request, from the reference information storage 
means: method retrieval means for obtaining instruc- 
tions relating to acquiring methods for retrieving infor- 
mation specified in the information retrieval request from 
the database systems which contain information speci- 
fied in the information retrieval request: and information 
retrieval means for determining relevant aatabase sys- 
tems according to information obtained from the storage 
location information retrieval means, and. in conformity 
with the acquiring method, for retrieving information 
specified in an information retrieval content included in 
the information retrieval request. 

Accordingly the present information retrieval sys- 
tem allows the user to retrieve desired information from 
a plurality of storage locations and dataoase systems 
dv simply specifying retrieval content and retrieval con- 



ditions using words familiar to the user, without having 
to know the names for the relevant databases or their 
system structures or to interact with the meta-informa- 
tion platform. 

s The above method and the information retrieval 

system presented above are further enhanced by using 
a recording medium, readable by computer means, hav- 
ing pre-recorded information resource dictionary data: 
wherein , the information resource dictionary data are re- 

io corded in: a column information file for managing col- 
umns of requested items specified in tables; a table in- 
formation file for managing tables contained in each da- 
tabase: a database information file for managing loca- 
tions of each database; and a database management 

*s system file, known as a DBMS file, for managing dedi- 
cated information for methods of acquiring requested 
items from each database; and wherein 'the column in- 
formation file has an allocation for recording column 
names contained in tables in relation to table titles so 

20 as to relate column names to requested items specified 
in the information retrieval request: the table information 
file has an allocation for recording table titles contained 
in databases in relation to database titles so as to relate 
the column information file to the table information file 

2S through table titles: the database information file has an 
allocation, for each of databases connected to the com- 
munication network, for recording database titles, host 
names indicating locations of databases and DBMS 
names for each database being managed by a DBMS, 

30 so as to relate the table information file and the database 
information file through database titles: and the dedicat- 
ed DBMS file has an allocation, for each of DBMSs. for 
recording DBMS names in relation to dedicated infor- 
mation for each DBMS, so as to relate the database in- 

3S formation file and the dedicated DBMS file through da- 
tabase titles. 

Accordingly, the recording medium having the infor- 
mation resource dictionary data allows the user to spec- 
ify retrieval conditions and retrieval content using words 

-to that are familiar to the user, because the dictionary is 
utilized to convert the requested items entered by the 
user to related data items used by different databases 
and database systems so as to retrieve requested infor- 
mation from a plurality of storage locations and different 
database systems, according to the retrieval content 
and retrieval conditions specified by the user, without 
having the user to specify the names for databases or 
their structures. 

The retrieving method, the tetrieving system work- 

so ing in conjunction with the information resource diction- 
ary data to assist effective information retrieval are all 
enabled by a recording medium, having a computer-ex- 
ecutable program, comprising: reference information 
storage means for storing access requirements for ac- 

ss cessing the plurality of different databases: storage lo- 
cation information retrieval means tor retrieving stored 
information in databases which store data to satisfy the 
information retrieval request, from the reference infer- 
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mation storage means: method retrieval means for ob- 
taining instructions relating to acquiring methods for re- 
trieving data to satisfy the information retrieval request 
from the reference information storage means: and in- 
formation retrieval means for determining relevant da- 
tabases according to information obtained by the stor- 
age location information retrieval means, and retrieving 
information to satisfy the information retrieval request in 
conformity with the acquiring method obtained by the 
method retrieval means. 

Accordingly, the recording medium having the com- 
prehensive information retrieval program allows the us- 
er to retrieve desired information from a plurality of stor- 
age locations and database systems by simply specify- 
ing retrieval content and retrieval conditions using words 
familiar to the user, without having to know the names 
for the relevant databases or their structures or without 
having to interact with the meta-information platform. 

In overall summary, it is clear that the present meth- 
od and system of information retrieval, together with the 
essential tools of information resource dictionary and 
enabling operational program : represent a significant 
contribution to the needs of the coming information so- 
ciety by allowing desired information to be retrieved sim- 
ply, effectively and comprehensively from databases in 
any database systems connected to a communication 
network. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic illustration of the principle 
of the information retrieval methodology of the present 
invention. 

Figure 2 is a schematic representation of the prin- 
ciple to enable the retrieval methodology 

Figure 3 is a schematic representation of the sys- 
tem configuration of the information retrieval system. 

Figure 4 is a representation of the structure of infor- 
mation resource dictionary. 

Figure 5 is a representation of the information re- 
trieval request from a user. 

Figure 6 is a flowchart showing a series of informa- 
tion retrieval steps. 

Figure 7 is an example of the information resource 
dictionary in an embodiment of the information retrieval 
methodology. 

Figure 5 is an example of the information retrieval 
request in an embodiment. 

Figure 9 is an example of the table stored in the da- 
taoase in an embodiment. 

Figure 10 is an example of the information retrieval 
statement (SQL) in an embodiment. 

Figure 11 is an example of retrieved results for dis- 
play ; n a spreadsheet A format in an embodiment. 

Figure 1 2 is a representation of the information re- 
source dictionary with additional items. 

Figure 1 3 shows some examples of information re- 
trieval request and information resource aictionary and 
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their inter-relationship. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

5 

First, the principle of the methodology and the sys- 
tem configuration for enabling the methodology will be 
explained with reference to Figures 1 and 2. 

The methodology is described in terms of items to 

10 represent different types of data which are handled by 
the retrieval system: "data items" refer to general data 
which may be stored in any database; "request items or 
requested items" represent data being sought by the us- 
er: "related data items" represent data which are similar 

15 to the "requested items"; and "relation items" represent 
data which link one table to other table(s) through a re- 
lation item, i.e. some common data item(s) found in 
those tables. 

Figure 1 shows the principle of the methodology in 
20 a conceptual block diagram. 

The present invention relates to a method of infor- 
mation retrieval from a plurality of different databases 
which are connected to a communication network (corn- 
network) by generating an information retrieval request 
25. and retrieving information corresponding to the request. 
A series of sequential steps in the method are presented 
below. 

Storing reference information to enable accessing 
30 a plurality of database systems (step 1 ); 

Receiving an information retrieval request (step 2); 

Analyzing the information retrieval request (step 3): 

Determining relevant database systems which 

would contain the requested data in consultation 
35 with reference information (step 4); 

Determining how to acquire the requested data 

(step 5): 

Preparing an information retrieval statement ac- 
cording to acquired information on storage loca- 
te* tions and accessing methods for the database sys- 
tems which would contain the data specified in the 
information retrieval request (step 6); 
Retrieving information from relevant database sys- 
tems satisfying the information retrieval request 
is (step 7): 

Presenting the user with the retrieved information 
(step S). 

An aspect of this method is that the retrieved infci- 
50 mation can be presented to the user in a format con- 
forming to the application software used by the user so 
as to facilitate reading of the retrieved information by the 
user. 

Figure 2 shows an example of a system cenfigura- 
55 tion to enable the principle of the methodology to be 
practiced. 

The present information retrieval system is de- 
signed to operate in an environment comprised of' a 
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communication network: a plurality of database systems 
connected to the communication network; and informa- 
tion retrieval means for receiving an information retrieval 
request from a user and for retrieving information from 
the relevant database systems according to the infor- 
mation retrieval request. The information retrieval sys- 
tem includes components as follows: 

reference information storage means 70 for storing 
information concerning location information for data 
stored in a plurality of different database systems 
30., -80 n and information on accessing methods for 
retrieving the stored data: 

request analysis means 20 for analyzing the infor- 
mation retrieval request input by the information 
searcher; 

storage location information retrieval means 30 for 
acquiring, from the reference information storage 
means 70, information concerning the locations of 
the database systems which store the data to sat- 
isfy the analyzed request: 

method retrieval means 35 having a procedure to 
obtain acquiring methods for retrieving data to cor- 
respond to the analyzed request, from the relevant 
database systems, by consulting the reference in- 
formation storage means 70 : and a procedure to se- 
lect optimum table groups from a combination of re- 
lated data items selected by the request analysis 
means 20: and 

information retrieval means 40 for specifying rele- 
vant database systems to be searched according 
to information obtained from the storage location re- 
trieval means 30, and for retrieving obtained infor- 
mation according to acquiring methods obtained by 
the method retrieval means 35. 

The present information retrieval system is further 
provided with retrieved data presentation means 10 for 
transforming information corresponding to the informa- 
tion retrieval request obtained by information retrieval 
means 40, into a format which can be read by an appli- 
cation software used by the information searcher so as 
to facilitate processing of the acquired information by 
the user. 

The request analysis means 20 also includes item 
conversion means for convening the requested items in 
the information retrieval request into related data item 
names described in the database systems. 

The storage location retrieval means 30 includes a 
procedure for selecting an optimum table grouping from 
a combination of names for the related data items pro- 
duced from the results of analysis performed by the re- 
quest analysis means 20. 

In the conventional systems, a user (information 
searcher) must directly specify items of interest in a me- 
ta-mformation platform, therefore, it is necessary for the 
user to be aware of the system conients. i e. the types 
of database systems connected to the com-networK 



(communication network), and the types of tables con- 
tained in the different database systems. In contrast, in 
the present retrieval system, the retrieval conditions 
specified by the user and the requested items ex- 
s pressed in the retrieval content are analyzed so that us- 
er's words (names for requested items) familiar to the 
user are converted into system words (names for related 
data items) that can be recognized by the relevant da- 
tabase systems. 
io Further by knowing where the information is locat- 
ed, it is possible to select optimum table grouping by 
combining the related data items, to prepare an infor- 
mation retrieval statement (SQL), to execute the SQL, 
to convert the retrieved results into a format suitable for 
'5 reading in the user's environment. Therelore, it be- 
comes possible to process a retrieval request from a us- 
er who does not have system knowledge, such as the 
locations and structures of database systems, by ena- 
bling the retrieval system to prepare a SQL to actually 
20 acquire the information on user's behalf, thereby achiev- 
ing-an objective of the present invention that desired in- 
formation can be obtained, without requiring the user to 
know the locations and structures of different database 
systems connected to a corn-network or the method of 
25 acquiring the information in a particular meta-informa- 
tion platform. 

Next, the structure and operation of the information 
retrieval system of the present invention will be present- 
ed with reference to the drawings. 
30 Figure 3 shows the structure of the information re- 
trieval system. 

The retrieval system operates in an environment 
comprising: an interface section 110: a language anal- 
ysis section 120: an information location retrieval sec- 
35 tion 130: an information retrieval section 140: a middle-, 
wares section 150; an application softwares section 
160: an information resource dictionary 170: and a plu- 
rality of different database systems 1 80. 

The interface section 110 serves as the communi- 
•*o cation interface for the user to enter an information re- 
trieval request. 

The language analysis section 1 20 analyzes the in- 
formation retrieval request input through the interface 
section 110. In analyzing the information retrieval re- 
•*s quest, the retrieval conditions specified by the user and 
the requested items expressed in the retrieval content 
are convened into related data items found in different 
database systems. 

3ased on the results of the language analysis sec- 
so tion 120, the information location retrieval section 130 
determines where the requested items can be found in 
the database systems 180. in consultation with the in- 
formation resource dictionary 170. acauiros the location 
and method of obtaining the data from relevant data- 
55 base systems 1 80. and prepares a SQL so that searcn- 
es can be conducted through the database systems 
i SO Also, the information location retrieval section 130 
obtains necessary information to convert the search re- 
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suits into a format that can be presented to the user and 
transfers the information to the information retrieval sec- 
tion 140. 

The information retrieval section 140 executes the 
SQL. prepared by the information location retrieval sec- 
tion 1 30. searches for the data and converts the search 
results into a format that can be presented to the user 

The information resource dictionary 170 manages 
a dictionary containing all the meta-information on the 
database systems connected to the corn-network as 
well as similarity of data items, and acts as the reference 
resource to the information location retrieval section 
130. 

The middlewares section 150 is used for accessing 
the various database systems 180 connected to the 
corn-network. The retrieved results are displayed using 
commercial application softwares 160. 

Figure 4 shows the structure of the information re- 
source dictionary (IRD)170 containing various Data 
Item Titles (shortened to Items). 

Item 170a of the IRD 170 manages related data 
items, and groups similar related data items into appro- 
priate groups. 

Item 1 70b manages location information of the da- 
tabases 160, and in this example, includes the names 
for the database systems, host names indicating the lo- 
cation of the database systems, and the names for the 
data base management systems (DBMS) used to man- 
age the databases. 

Item 170c manages table information of the data- 
bases, and in this example, includes table titles and da- 
tabase titles. 

Item 1 70d manages column information, and in this 
example, includes column names and data grouping, 
type, length and table titles. 

Item I70e manages information unique to each 
DBMS, and checks tne presence of restrictions for each 
DBMS. 

Item 1 70f manages display information relating to a 
display format for the final retrieved results, including a 
format for expressions, output header information, and 
delimiter information for displaying a plurality of re- 
trieved results. 

Figure 5 shows a structure of the information re- 
trieval request presented by the information searcher. 
The information retrieval request is comprised of retriev- 
al content relating to the kind of information required, 
retrieval conditions -elating to the conditions attached 
to the requested information, and display format (elating 
to the application software being used by the user. 

Figure 6 shows a flowchart for a series of informa- 
tion retrieval steps performed in the present methodol- 
ogy. 

In step 101. the user (information searcher) enters 
an information retrieval request through the interface 
section 110 

In step i 02. ihe language analysis section 1 20 anal- 
yses the retrieval content, retrieval conditions and dis- 



play format in the information retrieval request input 
through the interface section 110. The language analy- 
sis section 1 20 analyses the retrieval content input by 
the user with reference to Item 1 70a of the IRD 1 70, and 
5 divides the requested items into groups of related data 
items of similar meaning to associate the related data 
items with keywords that the system will recognize. 

In step 103 : the information location retrieval sec- 
tion 130 determines relevant database systems which 
io satisfy the retrieval content, on the basis of the analysis 
of the retrieval content performed in step 1 02, in consul- 
tation with Item 170b, 170c and 170dofthe IRD 170. 

In step 104, the information location retrieval sec- 
tion 1 30 determines the method for accessing the spec- 
's jfjed database, on the basis of the analysis of the retriev- 
al conditions performed in step 102, and with reference 
to Item 170e of the IRD 170. 

In step 105, the information location retrieval sec- 
tion 1 30 prepares a SQL for the requested retrieval con- 
20 tent, on the basis of the access location information for 
the specified databases obtained in step 103 and the 
retrieval conditions obtained in step 104. 

In step 106, information retrieval section 140 per- 
forms information retrieval by accessing the specified 
25 databases according to the SQL prepared in step 105. 

In step 107, the retrieved results 170f are converted 
into a format specified by the user on the basis of the 
analysis of the information retrieval request performed 
in step 102, and are transferred to appropriate applica- 
nt? tion softwares 160 through the interface section 110 10 
be presented to the user. 

The execution of the database system will be ex- 
plained in more detail with reference toactual examples. 
An actual example of the database system ISO, 
35 comprised of a sumo-wrestler-data system and a per- 
sonnel-data system, connected to a corn-network will be 
used to illustrate how to obtain specific information con- 
cerning sumo-wrestler(s). 

To respond to inexact or fuzzy inquiries from the us- 
-*o ers, various items including related data items are pre- 
arranged in groups, as shown in Figure 7 : according to 
their similarities, and are stored under Item 170a of the 
information resource dictionary (IRD) 170. In the exam- 
ple shown in Item 170a. various related data items or 
-is words which might be used by the user such as "wres- 
tler name", "sumo wrestler", "earned sumo name" and 
"earned-ranking" are grouped unaer a group name of 
"wrestler names" which serves as a keyword. Other ex- 
amples of related data items, including "born in", "born 
50 in city, district, prefecture of", "born in prefecture of u "na- 
tive of", are grouped under a group name "native of 
which is recognized by the system as a keyword. 

When the words used by the user in the information 
retrieval request are not found in the list, a search is con- 
55 ducted directly in the IRD 170 to look for the columni st 
containing such a word or words. 

Items 1 70b. 1 70c and 1 70d manage information ro- 
tated to database structure Hem 1 70b contains two ca- 



11 



EP 0 829 811 A1 



12 



tabase titles "sumo-wrestler files* 1 and "personnel files", 
and manages the execution programs used in the host 
names and DBMSs. 

Item 1 70c contains table titles for the databases as- 
signed to Item 1 70b. The database named "sumo-wres- 
tler files" manages two table titles, "wrestler names" and 
"boss names" . 

Item !70d manages the column information con- 
tained in the tables assigned to Item 170c. According to 
the listing of tables shown for Item 170d (Figure 7), it 
can be seen that the table name "wrestler' contained in 
Item 170c includes other column names such as "boss 
ID", "wrestler name". "age", "birth date" : "born in prefec- 
ture" and "blood type" 

Item I70e manages description rules for executing 
a retrieval command in each ol the DBMSs 180. In the 
example illustrated. "Orade7" in item 170b is shown by 
"-" to indicate that the DBMS name is accessed with dou- 
ble quotation marks white "Informix" is shown by . 

Item 170f provides information regarding how to 
present the retrieved tesults m the format specified by 
the user. In the illustrated example, if the expression for- 
mat "spreadsheet A " is desired, the header is "Content 
-type: text/excel" and "tab" is used for its delimiter. 

In the above example, it is presumed that the data- 
base manager had registered the meta-information be- 
forehand, for the database systems connected to the 
corn-network, in the IRD 170 in the format as described 
under Items 170a-170f. 

In the following, the operation or the information re- 
trieval system, having the IRD 170 as described above, 
will be described for a case of an information retrieval 
request, shown in Figure 3. input by a user. The user 
enters search conditions indicating that he wishes to: 

[search for sumo-wrestlers who was born in Tokyo 
city and display the results in a spreadsheet A format]. 

The interface section 110 acquires the requested 
information based on the retrieval conditions and re- 
trieval content specified by the user. In this example, the 
system indicates to the user that the keywords which 
have been selected are "wrestler name" based on the 
retrieval content, and "born in" based on the retrieval 
conditions. 

These keywords are forwarded to the language 
analysis section 1 20. and using an item dictionary such 
as the one shown in Item 170a in the IRD 170. the sys- 
tem finds out that for the keyword "wrestler name", such 
data items as "sumo-wrestler", "earned sumo name", 
"earned tanking" which may be found in a database are 
related terms, and for the keyword "born in", such re- 
quested items as "born in u : "born in city district, prefec- 
ture of" and "born in prefecture" are related terms also. 

In the information location retrieval section 1 30, 
these related data items which are included in the "col- 
umn names" are searched in the IRD 1 70. and the result 
is that, as shown in Item I70d. the cata items, "wrestler 
name" ana "born in prefecture" are included in the 
"wrestler" table Proceeding further, the database name 



which contains the "wrestler" table is obtained from Item 
170c, the location of the database from the Item 170b, 
the host name for the table as "datapro", and DBMS as 
"Oracle7". 

s Further on, by knowing that data is managed by a 
DBMS in Item 1 70b : it is found, from the SQL description 
restriction, that for character specification in Oracle/ re- 
quires V\ double quotation marks as indicated in Item 
170e. 

to The process to this point has established that the 
relevant table for the retrieval content is to be found in 
the 'wrestler" table, and that the SQL can be prepared 
by selecting the "wrestler name" including a search con- 
dition 11 born in prefecture ="Tokyo° ", so that the SQL in 

is Figure 10 reads: 

DB1 (sumo-wrestler files DB); select wrestler 
name from wrestler where born in prefecture ="Tokyo" 
The SQL is now forwarded to the middleware sec- 
tion 150 so that the DBMS can execute searches, i.e., 

20 the SQL is booted up against the target database sys- 
tem 180. The database system 180 contains the table, 
shown in Figure 9, so that searches can be performed 
in the wrestler table to select a wrestler name "Takano- 
hana" whose born in prefecture is Tokyo. 

25 When the information retrieval section 140 thus re- 
ceives the search results, they are transferred to the in- 
terface section 110, where they are converted to a for- 
mat to correspond with the input condition for "display 
format"=spreadsheet A, and are displayed on the user's 

30 monitor screen. 

Accordingly, the retrieval result shown in Figure 11 
is displayed. 

The procedure presented above is acceptable 
when the information specified in the information retriev- 
es al request is found in one common table. However, when 
the specified information is found in a plurality of tables 
in a database system or database systems, it is neces- 
sary to have some method for relating these tables so 
that an optimum table grouping can be generated. This 
-to is because, the SQL presented above requires not only 
the location of information but also the method for re- 
trieving the information to be specified. 

Therefore, another Item shown in Figure 12 must 
be added to the IRD 170. Item 170gof the IRD 170 man- 
-ls ages information to link different tables, and is com- 
prised of "relation name" for relating the contents of dif- 
ferent tables; "primary reference DB" for indicating the 
database which stores the original table to be related to: 
"primary reference table" for indicating the original table: 
50 "primary reference column" for indicating the column 
name in the original reference table: "reference DB" for 
indicating the name of a database containing relevant 
tables: "reference table" for indicating a related tabic 
containing the requested items: and "reference column" 
55 for indicating the column name m the related table. 

Here, the method of using the related information 
presented in Figure 12 will be illustrated with reference 
;o a SQL 190 shown tn Figure 13. i e. an information 
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retrieval request which reads: 

[search for the names for sumo-wrestlers who was 
born in Tokyo and the names for their bosses, and show 
the results in a spreadsheet A]. 

The interface section 110 obtains the information to 
be retrieved from the retrieval conditions and the retriev- 
al content. The retrieval content indicates that "wrestler 
name" and "boss name' are keywords, and the retrieval 
conditions indicate that "born in" is a keyword. 

These keywords are forwarded to the language 
analysis section 120, which performs item analysis on 
each of the keywords : using an item dictionary simitar 
to the Item 1 70a in the I RD 1 70, and obtains information 
that for the keyword "wrestler name" , such data items 
as "sumo-wrestler", "earned sumo name", "earned rank- 
ing" are related terms, and for the keyword "born in" : 
such data items as "born in u : "born in city, district, pre- 
fecture of", and "born in prefecture" are related terms. It 
should be noted that "boss name" is not in Item 170a 
meaning that it must be searched for directly in Item 
170d. 

The information location section 130 searches for 
data items similar to "wrestler name" and "born in" as 
well as tor data term "boss name" which are contained 
in the "column names" in the 1RD 1 70. The result is that 
"boss none" , "wrestler name" and 'born in prefecture" 
are found within the column names, as indicated in Item 
170d in Figure 7. 

Here : it should be noted that, from Item I70d, the 
table name containing the column names "wrestler 
name" and "born in prefecture" is the "wrestler" table, 
and the table name containing the column name "boss 
name" is the "boss" table. Therefore, the system now 
knows that the information specified in the information 
retrieval request does not exit in the same table, there- 
fore, to execute the information retrieval request, it is 
necessary to obtain the information from these two ta- 
bles. This process is performed by extracting the condi- 
tions necessary to link the tables by utilizing the related 
information presented in Item 170g. 

To extract the conditions for linking the tables, first, 
the system examines whether there is a column name 
to relate the wrestler table with the boss table. This is 
achieved by searching through the reference table titles 
and priman/ reference table titles to find a matching set 
of related cata names, if there is information defining a 
relation between the tables, the primary reference col- 
umn name and the reference column name are extract- 
ed, and are considered to be the conaitions for linking 
the tables. 

In this example, the wrestler table and the boss ta- 
ble are related through a relation item "group" and the 
two tables are related through a column name "boss ID". 
It may be noted that, in the information retrieval state- 
ment ( SOL), this search condition is reflected by :ndicat- 
mg "wrestler boss lD=boss.boss ID". 

Proceeding on the database name "sumo-wrestler 
files" containing the tables "boss" and "wrestler" is ob- 



tained from Item 170c. and the host name "datapro is 
obtained from Item 1 70b as the location of the database 
and "Oracle7" as its DBMS. 

From Item 170b containing DBMS name "Oracle7" 

5 and information regarding the description restriction for 
SQL, shown in Item 170e : are obtained. 

The process to this point has established that the 
retrieval content relates to search tables "wrestler' and 
"boss" tables, and 'wrestler name, boss name' should 

w be searched under the search conditions "born in pre- 
fecture ="Tokyo" and wrestler boss ID=boss.boss ID" . 
This information is forwarded to the information retrieval 
section 140 which prepares a SQL to read: 

DB1 (sumo-wrestler files DB); select wrestler 

ts name, boss name from wrestler, boss, where born in 
prefecture =Tokyo and wrestler. boss lD=boss.boss ID 

The SQL is now forwarded to the middleware sec- 
tion 1 50 so that the DBMSs can execute searches, and 
the boss table and the wrestler table, such as those 

20 shown in Figure 9, are retrieved. The results show the 
wrestler name "Takanohana" from the wrestler table, 
and "Futagoyama" from the boss table. 

The retrieved results are converted into a format to 
correspond with the input condition for "display for- 

25 mat"=spreadsheet A, that is, in a format which can be 
read by the application software of the user and are dis- 
played on the user's monitor screen through the appli- 
cation software. 

It should be noted that, in the examples presented, 

30 those programs for executing the request analysis 
means 20; storage location information retrieval means 
30: method retrieval means 35: information retrieval 
means 1 40: retrieval information presentation means 10 
shown in Figure 2 as well as other programs for execut- 
es ing various processing sections shown in Figure 3 can 
be recorded on a recording medium that can be read 
into a computer which is used to conduct information 
retrieval from a plurality of different databases connect- 
ed to a corn-network. 

-to it is obvious that the foregoing examples are meant 
to be illustrative, and do not restrict in any way the ap- 
plicability of the method and the system. The principles 
is that results of information retrieval can be made more 
comprehensive by having own programs within the sys- 

-*5 tern to unaerstand how to use various DBMSs connect- 
ed to a network, can be modified to suit a wide variety 
of information retrieval applications within the range of 
claims presented in the following. 

so 

Claims 

1. A method for retrieving requested information from 
different database systems connected to a commu- 
ss nication network, by generating an information re- 
trieval request and retrieving information to satisfy 
said information retrieval request, comprising the 
steps of' 
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storing reference information beforehand to ac- 
cess a plurality of daiabase systems; 
consulting said reference information upon re- 
ceiving said information retrieval request: 
determining a database system which contains 5 
data to satisfy said information retrieval request 
as well as a storage location of said data, and 
obtaining instructions relating to an acquiring 
method for retrieving data from said database 

w 

system; 

preparing an information retrieval statement in 
conformity with said storage location and said 
acquiring method; 

searching through said database system for re- 
quested information according to said informa- 15 
tion retrieval statement and said acquiring 
method: and 

presenting retrieved information to an informa- 
tion searcher. 

20 

A method according to claim 1, wherein, upon re- 
ceiving said information retrieval request, said in- 
formation retrieval request is analyzed, in consulta- 
tion with said reference information, to convert re- 
quested items specified in said information retrieval 
content into related data items, so as to determine 
a database system which contains said related data 
items corresponding to said requested items, and 
to obtain said acquiring method for obtaining data 
from said database system. 

A method according to claim 1 . wherein, when stor- 
age locations for said related data items similar to 
said requested items are distributed over a plurality 
of storage locations, relation items are utilized for 35 
linking said storage locations in said plurality of da- 
tabases, in consultation with said reference infor- 
mation, so as to determine conditions for linking 
said plurality of storage locations. ^ 

A method according to claim 1, wherein presenta- 
tion of retrieved information is conducted according 
to a display format specified in said information re- 
trieval reauest by converting retrieved information 
into a format which can be read by an application 
software being used by said information searcher. 

A method for retrieving requested information from 
a plurality of different database systems connected 
to a communication network by generating an infor- *o 
nation retrieval request ana retrieving information 
to correspond to said infcrmation retrieval request, 
comprising the stops of: 

storing reference information beforehand to ac- ss 
cess a olurality of database systems: 
consulting saic reference information uoon re- 
ceiving said information retrieval request: 
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analyzing said information retrieval request so 
as to convert requested items specified in an 
information retrieval content into related data 
items: 

determining a database system which contains 
said related data items and a storage location 
of said related data items according to analysis 
results, and, when said related data items are 
distributed over a plurality of storage locations, 
obtaining relation items for linking a plurality of 
storage locations, and determining conditions 
for linking said plurality of storage locations: 
specifying said database system including stor- 
age locations; 

obtaining instructions relating to acquiring 
methods for retrieving data from said database 
system; 

preparing an information retrieval statement in 
conformity with said storage location and said 
acquiring method; 

searching through said database system ac- 
cording to said information retrieval statement; 
and 

converting retrieved results to conform to a dis- 
play format information included in said infor- 
mation retrieval request so as to enable re- 
trieved results to be read using an application 
software used by an information searcher. 

An information retrieval system operating in an en- 
vironment comprised of a communication network, 
a plurality of different database systems connected 
to said communication network, and retrieval 
means for retrieving information in response to an 
information retrieval request entered by an informa- 
tion searcher said system comprising: 

reference information storage means for stor- 
ing access requirements for accessing said plu- 
rality of different database systems; 
storage location information retrieval means for 
retrieving information, concerning database lo- 
cations, database structures and database for- 
mat to satisfy said information retrieval request, 
from said reference information storage 
means. 

method retrieval means for obtaining instruc- 
tions relating 10 acquiring methods for retriev- 
ing information specified in said information re- 
trieval request from relevant database systems 
in consultation with said reference information 
storage means: and 

information retrieval means for determining rel- 
evant database systems according to informa- 
tion obtained from said storage location infor- 
mation retrieval means, and. in conform.ty with 
said acquiring methods, for retrieving informa- 
tton specified in an information retrieval content 
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included in said information retrieval request. 

7. An information retrieval system according to claim 
6, wherein said information retrieval system is fur- 
ther provided with request analysis means for ana- s 
lyzing information retrieval request and for convert- 
ing requested items specified in said information re- 
trieval request into related data items which corre- 
spond to said retrieval content included in said in- 
formation retrieval request. w 

8. An information retrieval system according to claim 
6 ; wherein, when said related data items are distrib- 
uted over a plurality of storage locations, said stor- 
age location information retrieval means utilize said »5 
reference information storage means to obtain re- 
lation items for linking said plurality of storage loca- 
tions which contain data specified in said informa- 
tion retrieval request so as to determine conditions 

for linking said plurality of storage locations. 20 

9. An information retrieval system according to claim 
6 : wherein said information retrieval system is fur- 
ther provided with retrieved information presenta- 
tion means for converting retrieved results obtained 25 
by said information retrieval means into a display 
format which can be read by an application software 
being used by an information searcher. 

10. An information retrieval system operating in an en- 30 
vironment comprised of a communication network, 

a plurality of different database systems connected 
to said communication network, and retrieval 
means for retrieving information in response to an 
information retrieval request entered by an informa- 35 
tton searcher said system comprising: 

reference information storage means having 
access information for accessing said plurality 
of different database systems; 40 
request analysis means for analyzing said in- 
formation retrieval request and converting re- 
quested items specified in an information re- 
trieval content to related data items, in consul- 
tation with said reference information storage -*s 
means: 

storage location information retrieval means for 
determining that stored information in a data- 
base system contains said related data items 
derived by said request analysis means, and. so 
when said related data items are distributed 
over a plurality ol storage locations, for ena- 
bling said storage location information means 
to obtain, in consultation with said reference in- 
formation storage means, relation items to link ss 
said plurality of storage locations, including 
conditions for linking said plurality of storage lo- 
cations: 



method retrieval means for obtaining instruc- 
tions relating to acquiring methods for retriev- 
ing information specified in said information re- 
trieval request from said reference information 
storage means: and 

information retrieval means for determining rel- 
evant database systems in accordance with in- 
formation obtained from said storage location 
information retrieval means, and for retrieving 
related data items to correspond to said re- 
quested items specified in said information re- 
trieval content in conformity with said acquiring 
method; 

converting retrieved results, according to a dis- 
play format included in said information retriev- 
al statement so as to enable retrieved results 
to be read using an application software used 
by said information searcher. 

11 . A recording medium, readable by computer means: 
having information resource dictionary data record- 
ed for use in retrieving information from a plurality 
of different database systems connected to a com- 
munication network, by generating an information 
retrieval request containing requested items: 
wherein, 

said information resource dictionary data are 
recorded in: a column information file for man- 
aging columns of requested items specified in 
tables: a table information file for managing ta- 
bles contained in each database: a database 
information file for managing locations of each 
database: and a database management sys- 
tem file, known as a DBMS file, for managing 
dedicated information for methods of acquiring 
requested items from each database: and 
wherein 

said column information file has an allocation 
for recording column names contained in tables 
in relation to table titles so as to relate column 
names to requested items specified in said in- 
formation retrieval request: 
said table information file has an allocation for 
recording table titles contained in databases in 
relation to database titles so as to relate said 
column information file to said table information 
file through table titles; 

said database information file has an alloca- 
tion, for each of databases connectea to said 
communication network, for recording data- 
base titles, host names indicating locations of 
databases and DBMS names for each data- 
base being managed by a DBMS, so as to re- 
late said table information file and said data- 
base information file through database titles: 
and 

said dedicated DBMS file has an allocation for 



10 



19 



EP0 829 811 A1 



20 



each of DBMSs, for recording DBMS names in 
relation to dedicated information for each 
DBMS, so as to relate said database informa- 
tion file and said dedicated DBMS file through 
database titles. 5 

12. A recording medium, readable by computer means, 
having information resource dictionary data record- 
ed for use in retrieving information from a plurality 
of different database systems connected to a com- io 
munication network, by generating an information 
retrieval request containing requested items; 
wherein, 

said information resource dictionary data are 15 
recorded in: a related data items file for man- 
aging similar requested items in groups; a col- 
umn information file for managing requested 
items located in columns comprising tables: a 
relation information file for managing column 20 
names for linking tables; a table information file 
for managing tables contained in each data- 
base; a database information file for managing 
locations of each database: a database man- 
agement system file, known as a DBMS file, for 2s 
managing dedicated information for methods of 
acquiring requested items from databases: and 
a results display file for managing display meth- 
ods for retrieved results: wherein 
said related data items file has an allocation for 30 
recording related data items to represent re- 
quested items so as to relate said related data 
items to requested items specified in said infor- 
mation retrieval request 

said column information file has an allocation 35 
for recording column names comprising tables 
in relation to table titles so as to relate said re- 
lated data items file to said column information 
file through related data items in said related 
data items file and columns in said column in- -to 
formation file; 

said relation information file has an allocation 
for recording each table name in relation to col- 
umn names, in order to manage column names 
which link relevant tables, so as to relate said J $ 
column information file to said relation informa- 
tion file through table titles: 
said table information file has an allocation for 
recording table titles contained in databases in 
relation to database titles so as to relate said 50 
column information file to said tabie information 
file through table titles; 

said database information file has an alloca- 
tion, for each of aatabases connected to said 
communication network, for recording data- 55 
base titles, host names inaicating locations of 
databases and DBMS names for each data- 
base being managea by a DBMS, so as to re- 



late said table information file and said data- 
base information file through database titles; 
and 

said dedicated DBMS file has an allocation, for 
each of DBMSs, for recording DBMS names in 
relation to dedicated information for each 
DBMS : so as to relate said database informa- 
tion file and said dedicated DBMS file through 
database titles; and 

said results display file has an allocation for re- 
cording application software information in re- 
lation to format information for reading said ap- 
plication softwares, and said application soft- 
ware information includes an application soft- 
ware being used in said information retrieval re- 
quest. 

1 3. A recording medium, having an information retrieval 
program executable by computer means, for re- 
trieving information from a plurality of different da- 
tabase systems connected to a communication net- 
work, by generating an information retrieval request 
and obtaining information to satisfy said information 
retrieval request, comprising: 

reference information storage means for stor- 
ing access requirements for accessing said plu- 
rality of different database systems; 
storage location information retrieval means for 
retrieving stored information in database sys- 
tems, which store data to satisfy said informa- 
tion retrieval request, from said reference infor- 
mation storage means. 

method retrieval means for obtaining instruc- 
tions relating to acquiring methods for retriev- 
ing data to satisfy said information retrieval re- 
quest from said reference information storage 
means: and 

information retrieval means for determining rel- 
evant database systems according to informa- 
tion obtained by said storage location informa- 
tion retrieval means, and retrieving information 
to satisfy said information retrieval request in 
conformity with said acquiring method obtained 
by said method retrieval means. 

14. A recording medium, having an information retrieval 
program executable by computer means, for re- 
trieving information from a plurality of different da- 
tabase systems connected to a communication net- 
work, by generating an information retrieval request 
and obtaining information to satisfy said information 
retrieval request, comprising: 

reference information storage means for stor- 
ing access requirements for accessing said plu- 
rality of different database systems: 
request analysis means for analyzing said m- 
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formation retrieval request and convening re- 
quested items specified in said information re- 
trieval request to related data items, in consul- 
tation with said reference information storage 
means: s 
storage location information retrieval means for 
determining that stored data in database sys- 
tems contain said related data items derived 
from said request analysis means, and, when 
said related data items are distributed over a w 
plurality of storage locations, for enabling said 
storage location information means to obtain 
relation items, using said reference information 
storage means, for linking said plurality of stor- 
age locations, including conditions for linking is 
said plurality ol storage locations: 
method retrieval means for obtaining instruc- 
tions relating to acquiring methods for retriev- 
ing data to satisfy said information retrieval re- 
quest, in consultation with said reference infor- 20 
(nation storage means; and 
information retrieval means for determining rel- 
evant database systems according to informa- 
tion obtained by said storage location informa- 
tion retrieval means, and retrieving information 25 
corresponding to said information retrieval re- 
quest in conformity with said acquiring methods 
obtained by said method retrieval means: 
converting retrieved results, according to a dis- 
play format included in said information retriev- 30 
al statement so as to enable retrieved results 
to be read by an application software being 
used by an information searcher. 
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